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Abstract 

We consider online scheduling on multiple machines for jobs arriving one-by-one with 
the objective of minimizing the makespan. For any number of identical parallel or uniformly 
related machines, we provide a competitive-ratio approximation scheme that computes an 
online algorithm whose competitive ratio is arbitrarily close to the best possible competitive 
ratio. We also determine this value up to any desired accuracy. This is the first application 
of competitive-ratio approximation schemes in the online-list model. The result proves the 
applicability of the concept in different online models. We expect that it fosters further 
research on other online problems. 

1 Introduction 

Online scheduling problems have been studied extensively for more than two decades 22,24 . One 
of the most extensively investigated problems among them is the makespan minimization problem 
with jobs arriving one-by-one: We are given m identical parallel machines, and we assume 
throughout the paper that m is an arbitrary but fixed constant. The set of jobs J = {1, 2, . . .} 
with integral processing times pj > 1 (j G J) is presented to the online algorithm one after the 
other. Once a job is present, it must be assigned without splitting, immediately, and irrevocably 
to a machine before the next job is revealed. This model for revealing online information one- 
by-one is called the online-list model |22j . The goal is to minimize the makespan, that is, the 
last completion time of all currently present jobs. Using the standard three-field notation [T7] . 
we denote the problem as the online-list variant of Pm||C max . We also consider the more general 
model of uniformly related machines Qm||C max , where each machine i S {1, . . . , m} is given a 
speed Si and the execution time of job j on machine % is Pj/si. 

The performance of online algorithms is typically assessed by competitive analysis [201125) 
which determines the worst-case performance compared to an optimal offline algorithm. Let 
an instance / be defined by a set of jobs J with processing times pj (j G J), and m, the 
number of available machines. And let X m be the set of instances with m machines. We 
call an online algorithm p(m)- competitive if, for any problem instance / G I m , it achieves a 
solution with cost Alg(I) < p(m) ■ Opt(7), where Alg(J) and Opt(J) denote the solution 
value of the online and an optimal offline algorithm, respectively, for the same instance /. The 
competitive ratio palcX 171 ) 01 Alg is the infimum over all p such that Alg is p-competitive. 
The minimum competitive ratio p* (m) achievable by any online algorithm for instances in X m 
is called optimal for m machines. The optimal competitive ratio over all number of machines 
is p* := max me n p*(m). 

Only recently, the concept of competitive-ratio approximation schemes was introduced in [18] . 
Such an approximation scheme is a procedure that computes a nearly optimal online algorithm 
and at the same time provides a nearly exact estimate of the optimal competitive ratio. The 
general definition (without distinguishing by m) is as follows. 
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Definition 1. A competitive-ratio approximation scheme computes for a given e > an online 
algorithm Alg with a competitive ratio palg < (1 + s)p*- Moreover, it determines a value p' 
such that p' < p* < (1 + e)p'. 

In this paper we provide a competitive-ratio approximation scheme for the online-list variant 
of makespan minimization on identical parallel and uniformly related machines for any number 
of machines. This is the first competitive-ratio approximation scheme for a problem in the 
online-list model in contrast to previous work in the so-called online-time model |22j . In the 
latter model jobs are revealed to the algorithm online over time at their individual release date. 
Regarding the decision making process, an online algorithm has more freedom in this model, 
as it is allowed to postpone decisions or even revoke them as long as the jobs have not been 
executed. 

1.1 Related Work 

The online- list makespan minimization problem has been studied extensively — mainly, on identi- 
cal parallel machines. The classical list scheduling algorithm with competitive ratio 2 — 1/m [TB] 
is optimal for m G 2, 3 13 . For m > 4 better algorithms have been proposed and improved 
general lower bounds were shown in a series of works [2Tl4ll6l [T3HT5l[T9l l23 j . The currently 
best known bounds on the optimal competitive ratio p*(m) for some particular values of m 
are p*(4) G [1.732, 1.733], p*(5) G [1.770, 1.746], p*(6) G [1.8, 1.773], . . . , p* G [1.88, 1.9201]. 

On uniformly related machines the gap is much larger. The lower bound on the competitive 
ratio for an arbitrary number of machines is 2.564 [llj . while the currently best known upper 
bound is 5.828 [5]. Interestingly, the special case of two related machines is completely solved, 
meaning that the exact competitive ratio known [5] and even stronger, the exact ratio for any 
pair of speeds is known [r2"Il2"6"] . 

We also remark that the preemptive variant of the identical machine problem is completely 
solved and the optimal competitive ratio for any number of machines [7] is known. For uniformly 
related machines, an optimal online algorithm is known for any number of machines and any 
combination of speeds |10j . Interestingly, from their linear programming based approach it is 
not clear how to derive the actual value of the competitive ratio except for m G {3, 4}. 

Competitive-ratio approximation schemes were introduced by Giinther et al. in |18| . They 
focussed on scheduling problems in the online-time model and provide such schemes for vari- 
ous scheduling problems Pm| rj, (pmtn) | ^ WjCj, Qm| rj, (pmtn) | ^ WjCj (assuming a constant 
range of machine speeds without preemption), and Rm| rj,pmtn \ ^2vjjCj. They also consider 
minimizing the makespan, C max , and X^jej w j/(^j)> where / is an arbitrary monomial function 
with fixed exponent. Subsequently, Kurpisz et al. |21j showed how to construct competitive- 
ratio approximation schemes for Rm| rj |C max , makespan minimization in a job shop problem, 
and scheduling with delivery times — again, all in the online-time model. 

We are not aware of any publication of similar results for the online-list modeQ- Notice 
that the results in [TO] are conceptually strongly related. The main difference is that our ap- 
proximation scheme provides the algorithmic means to compute the actual value of the optimal 
competitive ratio (up to some error), whereas this remains open for the algorithm in |10) even 
though it is provably optimal. 

1.2 Our Results 

In this paper we provide competitive-ratio approximation schemes for the online-list variants 
of makespan minimization on identical parallel and uniformly related machines for any number 
of machines, that is, Pm||C max and Qm||C max . More precisely, given e > and an m G N, we 
provide an online algorithm Alg(to) with a competitive ratio /calg('ti) < (l+s)p* (m). Moreover, 
it determines a value p' such that p' < p*(m) < (1 + e)p' '. 

On a high level, we use a similar approach as in |18j . We first simplify and structure the 
input without changing the instance too much, and then we reduce the complexity of possible 

1 However, as of writing this we got contacted by another group of researchers [8] who obtained similar results 
as ours, independently. 
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online algorithms by interpreting them as an algorithm map that bases its decisions only on 
the currently unfinished jobs and the schedule history. The key insight is that a very limited 
(constant) part of the schedule history is sufficient to take decisions which are close to optimal. 
In contrast to the previous work in the online-time model, our amount of history that has to be 
considered depends on the size of the currently largest revealed job instead of the time at which 
the last jobs where released. 

The main purpose of this paper is to provide a proof of concept for competitive-ratio ap- 
proximation schemes and to show that it is also applicable to online problems in the online-list 
model. The actual gaps between upper and lower bounds on the optimal competitive ratios 
are rather small on identical parallel machines, but this is one of the most classical online-list 
problems. Since many online problems follow the online-list paradigm, we hope that this work 
fosters further research on competitive-ratio approximation schemes. 

Outline of the paper. We first consider identical parallel machines in Section [3] Then we 
argue on how to extend this technique to uniformly related machines in Section |3l We conclude 
with open questions and further research potential. 

2 Identical Parallel Machines 

We give a competitive-ratio approximation scheme for the online-list variant of Pm||C max . We 
first give some transformations that simplify the input and reduce the structural complexity of 
online schedules. Then we use the an abstract view on online algorithms to reduce complexity 
further and to describe the approximation scheme. 

2.1 Restrictions at 1 + s loss 

We will use the terminology that at 1+e loss we can restrict to instances or schedules with certain 
properties. This means that we lose at most a factor 1 + e in the objective value, as e —> 0, by 
limiting our attention to those instances. We bound several relevant parameters by constants. 
If not stated differently, any mentioned constant depends only on e and m. 

In the online-list model we refer to an iteration for each job arrival. We will slightly abuse 
notation and refer to the iteration in which job j is revealed as iteration j. For an online algorithm 
A, an instance /, and an iteration j, denote by Aj(I) the makespan of the schedule obtained after 
iteration j when processing instance / by algorithm A. Furthermore, we define p(J) '■= Sjej Pj- 

The first observation has been made already in other contexts, e.g., in [T] for minimiz- 
ing J2 w i c 3- 

Proposition 2. At 1 + e loss we can restrict to instances where all pj are powers of 1 + e . 

In order to simplify the construction of our algorithms, we can actually restrict to instances 
with a very simple and special structure. 

Lemma 3. At 1 + e loss we can restrict to instances where for each k £ N there are at most ^ 
jobs j with pj = (1 + e) k . 

Proof. Suppose that we have an online algorithm A' which achieves a competitive ratio of pa 1 (m) 
on instances with at most ^ jobs, each with processing time pj = (1 + e) k for each k £ N. Based 
on A' we construct an online algorithm A for arbitrary instances (assuming that processing times 
are powers of 1 +e) with a competitive ratio of PA(m) < (1 + 0(e))pA'(rn). 

Suppose that we are given an (arbitrary) instance I where all pj are powers of 1 + e. We 
construct an instance I' which we present to A'. Based on the schedule A'(I') we construct the 
schedule A(I). As long as for each k £ N at most jobs j with pj = (1 + e) k are released, the 
instances I and /' are identical and we define A(I) to be identical to A' (I'). Now suppose that 
in some iteration j a job j with pj = (1 + e) k is revealed after there where released already ^ 
other jobs with the same processing time. Let 7j denote the instance up to job j. Observe that 

Opt(^) > ^(l + e) fe . Letp:= (1 + e) r io Si+^ 2 -° PT ^)l > £ 2 ■ OPT(Ij). 
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We observe that so far at most ^ < ^ jobs of size p have been released since otherwise 
Opt(7]) > + 1) -p > i • ^ ■ e 2 OPT(7j) = OPT(Jj). Instead of j, in instance I' we release 

a new job j' with p.y — p. Suppose that algorithm A' assigns j 1 on machine i. Then, algorithm 
A assigns the next upcoming jobs pyi with pjn < pj to i, as long as their total processing time 
is bounded by p. More precisely, we define j ma x to be the maximum value such that for the 
set J := {j" 6 I\j < j" < j max A pj" < pj} it holds that p(J) < p. We define that algorithm 
A assigns all jobs in J to machine i. We call j' a container job. We say that after iteration 
imax the container job j' is full. Intuitively this means that we do not add any further jobs to 
j' . At each iteration j" we say that the jobs in J n Jj>> are in the container j'. Observe that 
Pf < Pj = (1 + e) fe < e 3 • OPT(7j) < e • p for all j' e J. 

By construction, we observe that for each fc e N there is at most one container job of size 
(1 + e) k which is less than a (1 — effraction full. In particular, if we create a new container job 

of size p := (1 + e)^ l ° Sl+c€ ■° p ' r ( i i)l then up to iteration j strictly less than ^ jobs (container 
jobs and normal jobs!) of size p have been released since otherwise OPT(Ij) > • p) > 

■ p- • e 2 OPT(7j) > OPT(Jj). According to the above definition, it can happen that we open a 
new container job while an old container with smaller size is not yet to a (1 — effraction full. In 
this case we close the smaller old container and do not add any further jobs to it. 

To prove the competitive ratio of A' we need to show that ^ 3 / ) < (1 + ^( e ))cFr(FJ — 
(1 + 0(e)) pA' (m). By construction we have that A(Ij) < A'(Ij) since A and A 1 assign the jobs in 
J{Ij) n J{Ij) to the same machines and on each machine i the total processing time of the jobs in 
J(Ij) \ J(I'j) is bounded from above by the total processing time of the jobs in J(I'j) \ J{Ij) (the 
container jobs) on this machine. It remains to show that OPT(Tj) < (1 + 0(e))OPT(7j). Based 
on OPT(ij) we construct a schedule 5* for whose makespan is bounded by (1 + 0(e))OPT(/j). 
In S, we assign all jobs in J(Ij) H J(I'j) to the same machine as in OPT(ij). Then, we assign 
the jobs in J{I'j) \ J(Ij) (the container jobs) greedily. If after the greedy assignment the global 
makespan does not change, then OPT(Jj) < Opt(Ij). 

Now suppose that after the greedy assignment the global makespan increases. Then the load 
of any two machines can differ by at most p which denotes the maximum processing time of a 
container job in I'y Note that p < (1 + e) \ logl +^ e ° PT ( / j)l anc ! observe that the makespan of 5 
is upper-bounded by ^ • p{I'j) + P- Since for each fc G N there is at most one container job of 
size (1 + e) k which is less than a (1 — effraction full we further conclude that 

OPT(/j) < --p(fy+p 
m J 



< l (1 + 0(e)M/) +p+ y, + 

•Opt(/<)] 



A'' 



normal jobs and (1 — e) — full container jobs 



less than (1— c)— full container jobs 

< -(l + 0(e))p(I J ) + 2{l + e)\ lo ^^ 2 -OMi' J )] + y {1 + ef 

l<fc'<Iog 1 + e £ 2 -OPT(/j) 

< I(i + 0(e))p(L,) + 2(1 + e) \ los ^ +-(l + e )i+iog 1+e (e 2 .OPT(/;)) 
m e 

< -(1 + 0(e))p(!j) + 2(1 + e)e 2 • Opt(J') + i^(e 2 • Opt(L')) 
m e 

< (1 + 0(e))OPT(/,) + 0(e) • Opt(/;.) 

which implies that Opt(7<) < (1 + 0(e))0PT(7j). □ 

2.2 Online Algorithms and Algorithm Maps 

As in |18j we use an abstract characterization of an online algorithm and interpret it as a map. 
The map gets as input the so far computed schedule and the size of the next released job j. 
Based on these data, it decides to which machine it assigns j. 
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To this end, we define a configuration C as follows. 

Definition 4. A configuration C is the combination of 

• a set J(C) of previously released jobs, including their order, 

• a map xc '■ J{C) — > {I,...,m} which defines the assignment of the jobs in J(C) to the 
machines, 

• the processing time pj* of the newly released but not yet assigned job. 

We will write only J or x (rather than J(C) or xc) when C is clear from the context. 
Let C denote the (infinite) set of configurations. We say that a configuration C is in phase k 
if maxj-gj^juij.} pj = (I + e) k . Let C be a configuration in phase k. We call a job j E J{C) 
relevant if pj > (1 + e) k ~ s where s G N is the smallest integer such that s > log 1+e m ( 1 5 +£ ) (note 
that s depends only on e and m and is independent of C). Denote by Jr{C) C J(C) U {j*} all 
relevant jobs for a configuration C . It will turn out later that at 1 + e loss we can neglect the jobs 
which are not relevant, which we will call irrelevant. Define by MS(C) :— max^ pixc 1 (*)) the 
makespan of C . Also, we define Opt(C) := Opt( J(C)) (note here that J(C) does not include 
the newly released job j*). 

We interpret an online algorithm for our problem on m machines as a map / : C —> {1, m}: 
Given a configuration C with a newly released job j*, the algorithm map / assigns j* to the 
machine f(C). Like for online algorithms, we denote by Pf{m) the competitive ratio obtained 
by the map /. 

Proposition 5. For each online algorithm A for the problem on m machines there is an algo- 
rithm map f such that Pf(m) ~ pa(jti). 

Definition 6. Let C, C be two configurations which are in phases k and k' , respectively. They 
are equivalent if there is a bijection a : Jr{C) — > Jr(C) such that 

• Pa{j) = (l+e) k '- k -pj, 

• Xc(j) = XC'(o-(j)) for all j £ J R (C) \ {j*}, and 

• °~U*) = f* if j* ^ Jr{C), where j'* denotes the newly released job in C . 

Proposition 7. There are only constantly many equivalence classes of configurations. 

In Definition |5] we neglect the jobs which are not relevant. This is justified by the following 
lemma. 

Lemma 8. Let C be a configuration for a phase k. Then p(J(C) \ Jr{C)) < e ■ (1 + £) k < 
e-OPT(C) < e-MS(C). 

Proof. Recall that by Lemma [3] we assumed that for each k 1 6 N there arc at most p- jobs j 
with pj = (1 + e) k . Hence, the total processing time of irrelevant jobs in J(C) is bounded by 

(jtnw t m\\ m n.,^'^ m (l + e) fe ~ a+1 m , ^k m i^- + s ) 1 ~" 
p{J{C)\J R {C)) < 2^ ^3( 1 + £ ) -^3 g =( 1 + £ ) ~i 

l<k'<k~s 

<e(l + e) fc . 

where the last inequality follows since m ^- 1+ ^f> < e by definition of s. Since C is a configuration 

in phase k, and thus, by definition a job j with pj = (1 + e) k must have been released. It follows 
thate-(I+e) fc <e-OPT(C)<e-MS(C). □ 

In particular, two equivalent configurations have almost the same makespan and their re- 
spective jobs have almost the same optimal makespan. 

Lemma 9. Let C, C be two equivalent configurations for phases k and k! , respectively. Then 
MS(C") < {l + 0{e)){l + e) k '- k • MS(C) and Opt(C") < (I + 0(e))(I + e) k '~ k ■ Opt(C). 

Proof. Follows from Definition [5] and Lemma [5J □ 
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Lemma 10. At 1 + e loss we can restrict to instances I such that for each iteration j we have 
that pj > max j><j pj> ■ (1 + s)~ s . 

Proof. When in some iteration j a job j is released with pj < maxj'< 3 pj' ■ (1 + s)~ s we assign j 
to some arbitrary machine. By Lemma [5] the total processing time of such jobs, released up to 
some iteration j', is bounded by e • OPTy. □ 

Lemma 11. At 1 + e loss we can restrict to algorithm maps f such that f(C) = f(C) for any 
two equivalent configurations C and C . 

Proof. Let / be an algorithm map without this property with a competitive ratio of Pf{m) on m 
machines. Based on / we construct a new algorithm map g with the claimed property such that 
Pg(m) < (l + e)/?/(m). 

We call a configuration C realistic for f, if there is an instance / such that / ends in 
configuration C when processing /. For each equivalence class C of configurations containing 
at least one realistic configuration, we pick a realistic representant C G C. We say that C 
represents C For all configurations C £ C equivalent to C we define g such that g(C) = f(C). 

We claim that g is always in a configuration C such that there is a configuration C with 
C ~ C such that C is realistic for /. We prove the claim by induction over the iterations. We 
start with the base case of zero previous iterations. Let C be a configuration which is realistic for 
g. Since so far no jobs have been scheduled C is also realistic for /. Now suppose that the claim 
is true when the first I jobs have been released. Suppose that after I jobs have been released g 
is in a configuration C for a phase k such that some configuration C for phase k' with C ~ C 
is realistic for /. Assume w.l.o.g. that C represents its equivalence class. Assume that in C a 
job j* with processing time pj* is released and denote by j'* the newly released job in C . 

By construction, g assigns j to the machine g(C) = f(C). Then a new (relevant) job j is 
released which yields the configuration C. Denote by C" the configuration which results from C' 
after assigning job j 1 to machine /(C) and the release of a new job j' with py = (f + e) k ~ k pj- 
Since j is relevant for C, it follows that j' is relevant for & . Since C ~ C and pj, = (f + e) k ~ k Pj 
we conclude that C ~ & . □ 

The decision of an algorithm map (with all above simplifications) for a configuration C 
depends only on the equivalence class of C . Since there are only constantly many equivalence 
classes for configurations (see Proposition!?]) and for each configuration there are only m possible 
decisions, there are only constantly many algorithm maps. Hence, we can enumerate them all. 
With the procedure given by the following lemma we estimate its competitive ratio. Finally, we 
output the map with the minimum estimated competitive ratio. 

Lemma 12. Let f be an algorithm map for m machines. There exists an algorithm which 
computes a value p with Pf(m) < p < (1 + e)pf(m). 

Proof. In order to determine Pf(m) it is sufficient to know all possible realistic configurations 
for /. By Lemma [21 the realistic configuration C with the worst competitive ratio determines 
Pf(m) up to an error of I + e. □ 

Combining all statements gives our main theorem. 

Theorem 13. There is a competitive-ratio approximation scheme for the online-list variant of 
the problem Pm||C max for any number of machines m. 

3 Uniformly Related Machines 

With small additional instance transformations we can apply a similar construction in the setting 
of uniformly related machines. By scaling processing times and machine speeds, we can assume 
w.l.o.g. that the slowest machine has unit speed. Let s max denote the speed of the fastest machine 
in a given instance. 

Proposition 14. Atl + e loss we can assume that the speed of each machine is a power ofl + e. 
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Lemma 15. At 1 + e loss, we can restrict to instances in which s max is bounded by m/e. 

Proof. Take a given schedule with makespan MS on related machines with speed values s\ , . . . , s max 
For each machine whose speed is at most ^ • s max , we take the jobs assigned to it and add them 
to the fastest machine. The moved processing volume increases the total processing volume on 
the fast machine by at most S- ■ MS. Thus, we can simply ignore machines whose speed is at 
most ^ • s max - The remaining machines have speeds in the range of [-!- • s m ax, s max ]. Since 
we assume that the slowest machine has unit speed, after rounding the speeds we have that 

Smax<™/e- □ 

Hence, for each value m there are only finitely many speed vectors si,...,s m . For each of 
these speed vectors, we can bound the number of jobs of the same size similarly to Lemma[3]with 
an additional dependence on s max < m/e. This allows us to define configurations, algorithm 
maps, and equivalence relations similarly as in the previous section. 

Theorem 16. There is a competitive-ratio approximation scheme for the online-list variant of 
the problem Qm||C max for any number of machines m. 



4 Conclusion and Further Research 

We provide competitive-ratio approximation schemes for the makespan minimization problem 
when jobs arrive online over a list. This proves that the concept of competitive-ratio approxi- 
mation schemes is not limited to online (scheduling) problems in the online-time model. 

The approximation schemes presented in this paper, compute a nearly optimal solution for 
any number of machines. On the theoretical side, it would be interesting to give a general 
approximation of the optimal competitive ratio over all possible numbers of machines m. This 
requires a better understanding of how p*(m) behaves as a function of m. We know that it is 
bounded from above (for every m). It seems intuitively (and suggested by known bounds) true 
that it is increasing in m. 

Our approximation schemes do not only determine nearly best possible online algorithms, 
they also provide the algorithmic tools to compute the value of the optimal competitive ratio 
up to any desired accuracy. This is interesting because it contrasts the common approach to 
derive upper and lower bounds on the (optimal) competitive ratio manually. In particular, 
our theory proves that a computer may execute the algorithm to compute the desired bounds. 
However, the drawback of our presented construction is its computational complexity. To reduce 
the gaps between the currently best known upper and lower bounds, we would have to chose 
a quite small accuracy parameter e which leads to a hopeless running time. We believe that a 
more careful design of the necessary input simplification and algorithm structuring might lead 
to approximation schemes that can compute explicitly the value of improved bounds. 

The current research on competitive-ratio approximation schemes focussed on particular on- 
line scheduling problems. Our vision is to use insights for particular problems to eventually 
characterize general properties of online problems that allow for a competitive-ratio approxima- 
tion scheme. 
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